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ABSTRACT 

The degeneracy between geometric (Alcock-Paczynski) and dynamic (redshift space) 
distortions in the pattern of the galaxy distribution has been a long standing problem 
in the study of the large scale structure of the universe. We examine the possibility of 
lifting this degeneracy and constraining cosmological parameters by using the Baryon 
Acoustic Oscillation (BAO) scale as a feature of known physical size, the sound horizon 
Ts w 150Mpc. We callibrate this scale with the equivalent feature in the Cosmic 
Microwave Background (CMB). First, we construct a toy model of a power spectrum 
which includes the BAO as well as geometric and dynamic distortions. By adding a 
prior onto the sound horizon of ^ 1% we show, using a Fisher matrix analysis, that 
error ellipses for line of sight and tangential distortion parameters shrink by a factor 
of two for a 20{h~^Gpc)^ 'DESpec/BigBOSS'-like galaxy survey including shot noise. 
This improvement is even more marked in smaller surveys. We also carry out a Monte 
Carlo Nested Sampling analysis on our parameter space. We find that Monte Carlo 
and Fisher methods can agree reasonably well for surveys with large volume but differ 
greatly for small volume surveys. 

Key words: (cosmology) large-scale structure of the Universe, distance scale, cos- 
mological parameters 



. ^ 1 INTRODUCTION 



In order to derive knowledge of cosmology from observations 
of the large scale structure of the universe, we must accu- 
rately record the angular positions and distances to billions 
of galaxies and then analyse their statistical distribution. 
To determine distances in cosmology it is first necessary 
to have a fiducial cosmological model that allows the ob- 
server to calculate quantities such as the angular diameter 
distance dA{z) and Hubble function H(z). Inevitably the un- 
derlying model the cosmologist has is incorrect in some way. 
This results in g eometric distortions know n as the Alcock- 
Paczynski effect l|Alcock k, Paczvnskilll97^ ) - distances mea- 
sured along the line of sight look different to those measured 
perpendicular to the line of sight (Nusscr 2005; Blake ct al. 
I2OIII : iKazin. Sanchez fc Blantonll201ll ') . 

The redshift of a galaxy, z, is not a true measure of 
a galaxy's distance but a measure of the recessional veloc- 
ity of the galaxy. Galaxies are involved in motions apart 
from the Hubble flow, such as falling onto a cluster un- 
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der gravity, these res ult in a dist ortion of the perceived 
distance to a galaxy llKaisej [19871 '). Because clusters grow 
under gravity, through studying their growth at different 
redshifts by isolating these redshift space distortions, we 
are able to learn how gravity behaves at different epochs. 
This may help us identify d e viations from General R e lativity 
d Sinipson fc Peacockl I2OI0I : iPercival fc Whitd l2009l : iLindeJ 
200|). It is well known that these two effects are degener- 
ate in some regimes. Dynamic and geometric distortions can 
both have an enhancing effect on the galaxy power spectrum 
in the line of sight. 

In order to identify and study these effects it is neces- 
sary to flnd features that are known to be the same physical 
size in the radial and transverse directions and the same size 
at all redshifts - a so called cosmic ruler. Unfortunately as- 
trophysical objects are neither uniform nor large enough to 
perform such tests. The scale of the Baryon Acoustic Oscil- 
lations (BAO), i.e. the sound horizon at the epoch of last 
scattering, rs, (seen on the CMB sk y and first detected in th e 
SDSS LRG galaxy distribution by lEisenstein et all (|2005l ')') 
can be used as such a standard ruler to te st our understand- 
ing of cosmology (for a recent review see lBassett fc Hlozekl 
(|2009l )V This characteristic clustering scale, which emerges 
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from physical processes in the early universe, should be the 
same in both the radial and transverse directions. 

In the transverse direction, measurement of the sound 
horizon gives a measurement of the angular diameter dis- 
tance at which the sound horizon is observed, dA{z)/rs, 
where 



dA{z) 



Ho l + z' 



where 



r{x) 




flk>0 
flk<0 



(1) 



(2) 



x{z) is the comoving distance 
X{z) = Ho 



dz' 

HIT)- 



(3) 



Thus the Hubble function and angular diameter distance are 
coupled in an FLRW universe. In the radial direction, the 
same BAO measurement gives H{z)rs. Assuming that the 
radiation fraction is negligible the Hubble function is 



H{z) = Ho^/{nm{l + z)'^ + + z)^ + QoEfiz)}, (4) 



where 



fiz) = {l + z) 



{3(l + mO+™o))cxp(-3™aZ/(l + z)} 



(5) 



takes into account an evolving dark energy equation of state, 
w, which can be split into a constant, wo, and time varying 
component, Wa, 

Wa 



■ Wo -f- 



\ + Z 



(6) 



There has bee n some debate regarding the detec- 
tion of radial BAO. iGaztanaga. Cabre fc Hriil (|2009l ) mea- 
sured H(yz') using the BAO peak in the line of si ght two- 
point correlation function. This was refuted bv iKazin et al.l 
(|2010l ) who claimed it was consistent with background 
noise but used by others to get cosmological constraints 
IZhai. Wan fc Zhanj2o"Tol). Criticisms of this detection were 
riposted bv ICabre fc Gaztafiagal (|201ll '). Detections consis- 
tent with background noise can still be used to improve 
cosmological constraints if one assumes the ACDM cosmol- 
ogy. Future projects such as PAtQ, DESpecQ and BigBOS^ 
should provide stronger detections of the radial BAO. 

Here we investigate how calibrating the BAO scale in 
the galaxy distribution to that measured in the CMB can re- 
move the degeneracy between geometric and redshift space 
distortions improving constraints on cosmological parame- 
ters. Both of these probes, i.e. probes sensitive to the back- 
ground expansion and the growth of perturbations, are im- 
portant if one wants to distinguish the effect of dark energy 
from modified gravity. 

Other similar work has been done on this by 
ISimpson fc Peacockl ()2010l l. They focused on looking at dark 
energy versus modified gravity and concluded that CMB 



calibration of the BAO scale was not a particularly power- 
ful way of telling the difference betwee n the two. Work by 
ISamushia. Percival fc RaccanellJ ()201ll ^ is also similar, they 
foc us on constrain i ng cur vature. 

IShapiro et all (|2010| ) make the interesting point that 
often when we consider constraints from combined probes we 
merely over plot the likelihood contours even though CMB 
information is routinely included in each probe. Meaning 
that it is included more than once in the final interpretation 
of the results. This highlights the need to understand CMB 
calibration. 

We look at constraints from a "small" survey of vol- 
ume l(/i"^Gpc)^ and a "large" survey of 20(/i"^Gpc)^. This 
large survey is exemplary of future spectroscopic galaxy 
redshift surveys like BigBOSS, a proposed all sky spectro- 
scopic galaxy redshift survey and DESpec, a proposed spec- 
troscopic follow up to the photometric Dark Energy Sur- 
vejQ. Surveys such as DESpec/BigBOSS will be large and 
deep enough to use redshift space distortions to measure the 
growth of structure in the same redshift bins used to mea- 
sure the Hubble function from radial BAO measurements. 
We make the assumption that if redshift measurements are 
accurate enough to measure the radial BAO scale then the 
precision o f these measurements will have little effect on our 
results fsee lBem'tez et all l|2009l )). We do not, therefore, con- 
sider how uncertainties in redshift measurements will affect 
our results but instead leave this for further work. Nor do 
we consider non linear redshift space distortions. 

In order to get a feel for the effect of CMB calibration 
and to avoid adding CMB information via a "black box" , we 
shall start our investigation with a toy model for the galaxy 
power spectrum (section [2]). This toy model includes the 
BAO and geometric and redshift distortions. Later we will 
investigate the problem with a numerically calculated power 
spect rum, produced by CAMB (iLewis, Challinor fc Lascnb^ 
I2OOOII to allow for a time vary ing dark energy equation of 
state iFang. Hu fc Lewis|[200a ) (section [3|. 

In section|4]we investigate the effect of CMB calibration 
on the parameters used to write our toy model using Fisher 
information. We show how a prior on just the sound horizon 
alone reduces the area of the Fisher matrix Icr ellipse in 
the parameter space describing the geometric and redshift 
distortions. 

In section[5]we investigate how nested sampling explores 
the parameter space and show that derived constraints on 
parameters are not the same. Through the use of impor- 
tance sampling (see section [5] for an explanation) we pro- 
duced combined constraints for our numerical galaxy power 
spectrum and CMB observations (usi ng a publicly available 
WMAP chain (|Komatsu et al.l I2OI0I ')'). We compare these 
constraints with those from the power spectrum and a prior 
on the sound horizon alone finding that they are not the 
same. We present our main results in section [6] and discuss 
their implications in section [T] 



^ http: / / www.pausurvey.org 

^ http: / /eag. fnal.gov/DESpec/Home. html 

^ http://bigboss.lbl.gov 
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2 A TOY MODEL FOR THE OBSERVED 
GALAXY POWER SPECTRUM 

Here we construct a toy model for the observed galaxy power 
spectrum which includes BAO 'wiggles' in addition to ge- 
ometric and redshift space distortions. By design this toy 
model is not explicitly dependant upon cosmology acting as 
a fitting function for the observed anisotropic power spec- 
trum. In reality, of course, the shape and distortions of the 
galaxy power spectrum are cosmologically dependant. The 
aim of this toy model is to focus clearly on quantities that 
are measured from the large scale structure (LSS) two point 
cor relator. Our toy power spe ctrum is similar to that used 
by ISimpson fc Peacockl (|2010l ) but simpler, excluding non- 
linear redshift space distortions. 

The Bardeen, Bond, Kaiser, Szalay (BBKS) trans- 
fer function (equation [T]) gives an analytically simple 
power spectrum wh ich is satisfactory for our purposes 
l|Bardeen et al.ll 19861 ). 

n.) = i^^ii^^x [1 + 3.89. + (16.2,)^ (7) 
+ {5A7qf + (6.71g)'']-°•''^ 

where g = -p, F, is the power spectrum shape parameter. 
This gives us the linear, 'no wiggles' part of our power spec- 
trum. 



Pnbik) = A*k''TBBjfsi- 



(8) 



A* is the amplitude of the power spectrum and is in- 
dependent of cosmology and F is the shape parameter, 
which probes th e matter density and baryon fraction thus 
l|Sugivamalll995h : 



r — flmhexp ( — f2i, — V2h—^ 

J I'm 



(9) 



n is the spectral index, which for a Harrison- Zeldovich spec- 
trum is equal to 1 (inflation typically predicts that this num- 
ber is less than 1). The spectral index is different from the 
other parameters in this power spectrum since it can be con- 
sidered a cosmological parameter in itself and not as such 
dependant on cosmology, we therefore keep it fixed at n = 1. 

An approximation to the baryon acoustic oscillations 
can be added to this by multiplying by a slowly vary- 
ing sinusoidal function whose peaks are harmonics of the 
horizon scale sin(A;rs), with some damping given by some 
exponential part similar t o the fitting function used by 
iGlazebrook fc Blakd (|2005h 



P{k) = Pnb{k) 



l+AsAokexp 



sin(rsfc) 



(10) 

However, this is not how the power spectrum appears 
to the observer. We have to incorporate into our formalism 
descriptions of the linear dynamic and geometric distortions. 
In redshift space, allowing for anisotropics, the power spec- 
trum takes the form 



P"(fc||,fci 



where 



kl + k"^. 



(11) 



(12) 



0.m{z)'' is the equivalent of the logarithmic rate of change 
of the linear growth factor with respect to redshift. 7 is de- 
pendant on the models of gravity and structure formation 
used and can be dependant on redshift. This simple param- 
eterisation of the growth of structure has been shown to 
be effective in capturing the behaviour of a large class of 
models. Standard GR in a ACDM universe gives a constant 
7 = 0.55, whilst the PGP bra inworld model is approximated 
by 7 = 0.68 ( Finder fc Cahnll2007. '). Some theories of gravity 
require a 7 that slowly varies with redshift and in theories 
of gravity which allow for the presence of anisotropic stress 
7 can be scale dependant. When designing experiments to 
measure 7 we will therefore have to consider measuring 7 in 
different redshift bins and at different scales in order to con- 
strain deviations from GR. A simple extension to in clude a 
dependence on the dark energy equation of state is (| Finder! 
i2005.) 



7(2) = 0.55 + 0.05(1 + «;(2)). 



(13) 



b{z) is the bias, a measure of how well the galaxy distribution 
follows the underlying dark matter distribution. In reality 
this will be a function of redshift but for now we shall assume 
that it is a constant and is equal to unity, fcy and k± are 
respectively the parallel and perpendicular components of 
k. 

[Jennings. Baugh fc PascolU l|201 show, using N body 
simulations, that strictly linear redshift distortions are very 
poor at measuring the growth rate in non ACDM cosmolo- 
gies, even at large scales. However, we w ill follow the likes 
of Samushia, Pcrcival fc Raccanelil (l201lf ) in assuming that 
linear redshift distortions are a good enough approximation 
at large scales. 

Considering the Alcock-Paczynski effect we need to in- 
troduce parameters that describe the ratio between the ac- 
tual and assumed cosmologies in the parallel and perpendic- 
ular directions, 



H{z) 



Cxl2 = 



dA{z) 



(14) 



These parameters relate to the Hubble parameter H{z) and 
angular diameter distance dA{z). If our fiducial cosmology 
is correct (and therefore correctly describing our observed 
power spectrum) then cy — c± = 1. Thus by measuring 
the sound horizon in both directions we can determine how 
well we know our cosmology. It should be noted that there 
is still a degeneracy between H{z) and dA since both are 
dependant upon cosmology and are coupled. Meaning that 
a result of cy 7^ cx 7^ 1 does not tell us how our fiducial 
cosmology is wrong, merely that it is. 

The observed power spectrum is then described by 



P'"'%k^^,kj 



C||(z)ci(z) 



P"(fc||,fcx). 



(15) 



Where fcy — fc||/c|| and k± — kj_/c±. The prefactor is needed 
because geometric distortions lead to a misestimation of the 
survey volume. The fiducial values for the parameters in 
this toy model are given in table [T] and the power spectrum 
illustrated in figure [T] 
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Figure 1. The model power spectra used in our analysis, the 
colourbars have units of ln(/i~' IMpc). The top left panel is the 
anisotropic toy model power spectrum (as outlined in section [2]l, 
the top right panel gives the error, tr^, on this model, for the 
20(h-lGpc)3 'DESpec/BigBOSS' survey with shot noise. The 
bottom two panels are the CAMB power spectrum (as outlined 
in section [Sjl and the error bars associated with it. The fcbin size 
is (1.9 X 10~^/iMpc~^)^. The colour scales used in these plots are 
logarithmic. 



In our toy model the amplitude, A* , is a normalising 
factor, as is the amplitude parameter, A, that we vary in 
CAMB. The amplitude of the BAO, Abao, the power spec- 
trum shape parameter, F and sound horizon scale, r^, are 
dependant on the fraction of dark matter and of baryons 
and on h, so we vary rJcdm, ^^b and Ho- We assume a flat 
cosmology, so ficdm + f^b + f^DE = 1 always. We also vary 
the linear bias parameter b because the redshift space distor- 
tion parameter, /3 is dependant on galaxy biasing. All other 
parameter values are kept fixed at the default CAMB values. 

CMB data, such as that from WMAP or that which 
will come from Planck, contains more information than just 
the sound horizon scale and its error. It contains a plethora 
of cosmological information. Information contained in CMB 
observations on the geometry and the make up of the uni- 
verse could also be used to improve parameter constraints. 
For example, the CMB gives us knowledge of Qm, this can 
be used as a prior on /3. Indeed if we want to look for devia- 
tions from GR using measurements of /? it is first necessary 
to understand the matter fraction, Qrn- In this paper we 
shall not investigate how explicit priors on parameters such 
as the matter fraction affect constraints, leaving this for fur- 
ther work. Instead we shall just investigate the prior on the 
sound horizon. 



A* 


Abao 


r 


rs P C|| 


cx 


3.80 


2.00 


0.15 


149.86 0.52 1 


1 



Table 1 . Fiducial values of the parameters of our toy power spec- 
trum used to build the Fisher matrix (equations 1 101 - [ T5)l . 



3 A MORE REALISTIC MODEL FOR THE 
GALAXY POWER SPECTRUM 

The toy model, by design, is not explicitly dependant on cos- 
mology. The distortion parameters can vary independently 
of the parameters that describe the shape of the power spec- 
trum. In reality geometric and dynamic space distortions are 
dependant on the same cosmology that determines the shape 
of the power spectrum, so they cannot vary independently 
of the underlying power spectrum. 

To build a more cosmologically based model we use a 
power spectrum generated by CAMB with geometric and 
redshift space distortions added in the same way as our an- 
alytic toy model. There are many parameters in CAMB that 
one could choose to vary. Here we have chosen parameters 
that can be related to those in our toy model. In particular, 
six of the parameters we have chosen determine the distor- 
tions on the power spectrum. The numerical model power 
spectrum is illustrated in figure [T] and the fiducial values of 
its parameters given in tabled 



^cdm 




^DE 


too 


Wa Ho 


b 


A 


0.218 


0.044 


0.738 


-0.827 


-0.753 71.5 


1.0 


2.3 



Table 2. Fiducial cosmology for our CAMB power spectrum, 
based on the best fit parameters for the WMAP with time varying 
dark energy equation of state. 



4 FISHER ANALYSIS 

The Fisher Matrix formalism is a way of estimating the 
amount of information available in a given parameter space. 
It is a way of propagating errors in the case of multiple cor- 
rela ted measurements an d many parameter s. 

iBassett et all (|2009l ) or lAlbrecht et all (2009) provide a 
more detailed explanation of the statistics behind the Fisher 
matrix and also on ho w they are correctly manipulated. 
ISeo fc Eisensteiiil (|2007l ) look at isolating the information 
cantained in the BAO peaks using Fisher matrices. We try 
to keep our use of Fisher matrices as simple as possible since 
later we will do a statistically more robust analysis. 

The Fisher information is a measure of how much 
information an observable (in this case the power spec- 
trum) contains about some unknown parameter 6 (in this 
case the seven observable parameters in our toy model 
A* , Abao,^ ,ra, I3,c_i_,c\\). It is defined by the expectation 
value of the derivatives of the logarithm of the likelihood 
function, C, with respect to parameters 6i. 



(16) 



Assuming that the noise on our data (the power spectrum) 
is Gaussian. The likelihood function for this observable can 
be expressed in terms of the theoretical value for that ob- 
servable evaluated at some point, in our case at a particular 
k value. 

With some manipulation we can express the Fisher ma- 
trix as a sum over derivatives of the power spectrum with 
respect to parameters 9i rather than as derivatives of the 
likelihood function. 

1 dP{k„)dP{K) 



F 



do. 



(17) 



where <jn is the Gaussian error on P{kn). The error on the 
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power spectrum is as follows. 



5P : 



2 

Ank^AkV V n 



(18) 



Thus for the limit of large n i.e. ignoring shot noise only 
keeping the cosmic variance term 



SP{k„) = 



A-jvk'^AkV 



(19) 



where V is the volume of the survey. We will however al- 
ways include shot noise in our calculations, assuming a con- 
stant number density of tracers n = 5 x 10~^(/t~^Mpc)~^ . 
For a more detailed explanation see iTeemark et al ] l| 19981 ). 
(There is the implicit assumption that uncertainties in the 
measurement of the power spectrum are independent of cos- 
mology. This assumption is incorrect since knowledge of the 
Hubble function and angular diameter distance are needed 
to convert redshifts to galaxy positions and derive a power 
spectrum from observational data.) We then put this into 
equation 1171 (SP)^ = a^, using dP/P = dlnP and recog- 
nising that because the effects on the power spectrum that 
we are interested in are anisotropic we require the full two 
dimensional anisotropic Fisher matrix, we get: 

ainP(fc||,fcx) 91nP(fc||,fcx) 27rAfciAifc|| 

F^3 = 2^ — — TTT^, V. 



{2nf 



(20) 

The logarithmic derivatives of the toy model power spec- 
trum can be found in the appendix. We have assumed here 
that the different k-modes of the power spectrum are uncor- 
related. In reality a given galaxy redshift survey will only 
cover part of the sky and may vary in depth. Thus different 
k-modes of the power spectrum are correlated by a window 
function and are not independant. The assumption of uncor- 
related modes is valid if the fc-bin Afc >> 1/L, where L is 
the smallest linear dimension of the survey, in our case this is 
~ l(/i"^Gpc)^. Weusedafc-bin size of Afc = 1.9x 10"^/iMpc 
which satisfies this criterion. 

In order to reduce the size of our parameter set we have 
two options. Either "fix" the parameters we are uninterested 
in, i.e. assume that we know their values completely. If we 
think we know the value of a parameter completely then 
we can just remove that row and column from the Fisher 
matrix. This is equivalent to adding an infinite prior on 
that parameter. Or we can marginalise over that parame- 
ter. Treating it as a "nuisance" parameter. 

Constraints obtained from a marginalised Fisher ma- 
trix will always be weaker or equal to those from an un- 
marginalised matrix. Marginalisation will have no effect on 
the error on a parameter only if all other parameters are 
completely uncorrelated to it. This assumption breaks down 
because our observed parameters hide shared cosmological 
information and are therefore correlated. For example all 
our parameters except A* are dependant on f2m. 

The procedure we follow to marginalise over unwanted 
parameters is to first invert the Fisher matrix to get the co- 
variance matrix. Then we remove the row and column corre- 
sponding to the "nuisance" parameter and reinvert to obtain 
the new reduced Fisher matrix. This is then repeated until 
we have marginalised over all our "nuisance" parameters. 

In our toy model analysis we demonstrate calibration 




0.4 0.5 0.6 

/9 

Figure 2. An exploration of our toy model parameter space for 
a l(fc~^Gpc)'^ survey, focusing on the linear redshift distortion 
parameter /3 and the line of sight and perpendicular geometric 
distortion parameters, C|| and cj^. Fisher matrix Icr contours with 
(red) and without (blue) a prior on the sound horizon ■ Nested 
sampling Icr contours with (black) and without (green) a prior on 
the sound horizon r^. Small galaxy redshift surveys are unable to 
estimate the values of cosmological parameters acurately alone. It 
is therefore difficult to approximate the errors on these paramters 
as being Gaussian. 



of LSS information to the CMB by the addition of a prior 
on the sound horizon scale r^. Th e sound horizon can b e 
calculated using t he de scription bv lHu fc Sugivarnal l|l996f ): 
lEisenstein fc Hul l|l998l ). 

The prior matrix is simply a diagonal matrix made up 
of Pvi^j = where a is the error on parameter 9i and 

off diagonal components being zero. Adding this matrix to 
the original Fisher matrix then produces a matrix which 
includes the prior information. 

In Figures [5] and [3] we have drawn error ellipses repre- 
senting the constraints on pairs of parameters (marginalising 
over all other parameters) in our toy power spectrum from 
a l{h~^Gpc)^ volume limited survey (blue). The ellipses are 
then redrawn including only a prior on the sound horizon of 
~ 1%, i.e. that — 1.4Mpc/i~^ is a Gaussian error bar. 
This is about the size of the error bar on the the sound hori- 
zon scale from WMAP-7. The shrinking on the cx — cy ellipse 
implies that calibrating the sound horizon from the CMB to 
the sound horizon in the BAO features in the galaxy power 
spectrum can tell us whether we have the correct value for 
the angular diameter distance and Hubble function. The af- 
fect that decreasing the strength of the prior on the sound 
horizon has on constraints in the cx — cy , cx — /3 and cy — /3 
planes is illustrated in figure |4] Note how calibrating the 
sound horizon has next to no effect of our constraints on (3. 

However, within dA{z) and H{z) lie all the standard 
cosmological parameters (the dark energy equation of state 
appears in the Hubble function and in the angular diame- 
ter distance). In order to know how much information we 
have on those parameters from the original toy power spec- 
trum we need to transform the original Fisher matrix. This 
is a straight forward coordinate transformation of a rank-2 
tensor 
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Figure 3. An exploration of our toy model parameter space for 
a 20(/i~^Gpc)'^ survey, focusing on the linear redshift distortion 
parameter /3 and the line of sight and perpendicular geometric 
distortion parameters, cy and c±. Fisher matrix la contours with 
(red) and without (blue) a prior on the sound horizon rg . Nested 
sampling Icr contours with (black) and without (green) a prior on 
the sound horizon rs. 




ar^/(h-'Mpc) 

Figure 4. Area of the Icr Fisher ellipse with respect to cr^^ . Red 
line is cy-/?, green dotted is cx-/3, blue dot dash is cy-cx- The 
vertical black dotted lines show the value for used as a prior 
in figures [2] and |3] and a prior of 0.1%. The blue dotted and green 
dotted lines show the area of c± ~ f} and c± — cy ellipses respec- 
tively, with no prior on rg. 



(21) 



When transforming to cosmological parameters includ- 
ing prior information from the CMB then becomes more 
complicated since many of the cosmological parameters we 
are interested in have priors from WMAP and other CMB 
experiments. They will need to be added to the cosmolog- 
ical parameters Fisher matrix before any further marginal- 
isation if this information is to be included. We could do 
this by transforming an available Fisher matrix for a CMB 
probe into ou r base parameters and add ing it to our Fisher 
matrix as in (jSimpson fc Peacockllioiol ). However this does 



not help us understand the effect of CMB calibration, it 
merely assumes it is a necessary step. Here we are trying to 
investigate the limit of information available before adding a 
prior from the CMB. To investigate cosmological parameter 
space we need a more physical toy model and more robust 
statistical techniques. 



5 NESTED SAMPLING - A MORE ROBUST 
EXPLORATION OF PARAMETER SPACE 

We are interested to see how our toy model with one Gaus- 
sian prior compares to a full likelihood analysis of a numer- 
ically produced power spectrum combined with real CMB 
information. One commonly used metho d in cosmology 
is Ma, rkov- Chain Monte-Carlo (MCMC) bv lLewis fc Bridle! 

Algorithms such as the Metropolis-Hastings algo- 
rithm attempt to optimally explore the parameter space 
without wasteful calculations at uninteresting points. 

Our chosen method of statistical param- 
eter s p ace e xploration is 'Nested Sa m pling ' 
JSkillinel |2004|: iMukheriee. Parkinson fc Liddld l200d : 



iFeroz. Hobson fc Bridged l2009l : iFeroz fc HobsonI |2008| ). a 
completely different algorithm to MetropoHs-Hastings but 
with much the same aims. 

It was first designed for Bayesian evidence evaluation, 
at which it is more efficient than MCMC since it transforms 
the multidimensional evidence integral into a one dimen- 
sional problem. This allows evidence to be evaluated from a 
single chain rather than multiple chains as is the case with 
traditional MCMC. Nested sampling can also be used for pa- 
rameter estimation since it produces posterior inferences as a 
by product. Here we use the Multincst i mplement ation of the 
nested sampling algori thnjfl (Fe roz. Hobson fc Bridg es 20091; 
iFeroz fc HobsonI bOOSj ). There are other packages available, 
with different variations of the algorithm, such as Cosmonest 
IMukheriee. Parkinson fc Lid dlc 2006). 

The likelihood for each link in the chain is calculated 
from the for that point and fed into the algorithm 

(P-f"(fcx,fc||)-P'™'(fcx,fc||))' 



= E 



'(fcx,fe||) + i 



(22) 



exp 



-X 
2 



(23) 

We ran chains for a l(/i"-^Gpc)^ and a 20(/i"^Gpc)^ 
"DESpec/BigBOSS" style survey, with shot noise n = 5 x 
Mpc~^/i on our toy model power spectrum and on our nu- 
merically produced power spectrum. 

CMB information was then included through impor- 
tance sampling. Importance sampling enables us to combine 
informati on from different (independent) experiments (eg 
Bwanson, Percival fc Lahavl (jkllO). Given a pre calculated 
Monte Carlo chain of parameter values 9 which explores a 
likelihood distribution L, we can compute parameter con- 
straints relative to a similar distribution L' by reweighting 
the sample according to the likelihood ratio 



L'{9') 



(24) 



http://ccpforge.cse.rl.ac.uk/gf/project /mult inest/ 



For now we are interested in combining CMB and galaxy 
power spectrum information but we could combine any two 
independent experiments in this way. For two independent 
likelihood distributions, L and L', the combined likelihood 
distribution L is just the product of the two 

L{e) = L{e)L'{e). (25) 

Which then allows us to simply reweight each point in the 
chain 

^ L'\e')wl. (26) 

Now likelihood contours drawn in the parameter space will 
represent the combined likelihood. It is worth noting that 
the 'Metropolis Hastings' and 'Nested sampling' algorithms 
use different weighting schemes, MCMC points have integer 
weights whilst the sum of the weights in a Multinest chain 
is one. For ease of comparison it is necessary to normalise 
the weights accordingly. 

We took a WMAP chain which allowed for a time vary- 
ing dark energy equation of state and then calculated the 
likelihood that each point fitted the galaxy power spectrum 
using equation 1231 We then reweighted each point using 
equation [26] and the WMAP weight. The Alcock-Pazcynski 
C|| , Cj_ and redshift space distortion [3 parameters were then 
calculated for each point in the chain by comparing with the 
fiducial parameter values. This allows us to draw constraints 
in these planes (figure [S]). 

To include a prior on the sound horizon in our toy model 
we assume the likelihood distribution around the true value 
of is a Gaussian of the same width as the prior used in 
the Fisher analysis. This is then importance sampled with 
the toy model, giving rise to the black contours in figures [2] 
and 121 To include the Gaussian prior in the CAMB model 
we calculated rs for each point in the ch a in using the pre- 
script ion given in IHu fc Sugivamal (| 19961 ') : lEisenstein fc Hul 
l|l998t ). We then drew a probability for this value from a 
Gaussian distribution peaked at the fiducial value for cal- 
culated using our fiducial parameter set in table 1, with 
the same standard deviation as before. Each point was then 
reweighted using equation 1261 giving rise to the black con- 
tours in figure [5l 



6 RESULTS 

When comparing the 68% confidence limits in figure [2] from 
the Fisher (blue) and nested sampling (green) of the toy 
model we can see that for smaller survey areas there is some 
tension between the two methods. The areas bounded by the 
contours are similar, with the exception of the constraints 
around /3, but the two sets of contours do not cover exactly 
the same part of parameter space. The larger survey (figure 
[3| is, however, in better agreement. Although in the /3 — 
C|| , cx planes the areas covered by the contours still differ by 
about a factor of two. 

By design the maximum likelihood is the same in both 
cases, so the question arises, what is causing the disagree- 
ment? The answer can be found by examining the likelihood 
space in more detail. Let us look at the normalised Id profile 
of the parallel Alcock-Paczynski parameter, cy. We can see 
that for a l(/i~^Gpc)^ survey the likelihood is not Gaussian 
(figure[6]), being skewed. This puts asymmetric error bars on 
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0.98 1 1.02 1.04 1.06 



Figure 5. Constraints on /3 and cx, cy from nested sampling 
a CAMB power spectrum. Tlie black ellipse represents the Icr 
confidence limit based on a 20(/i-lGpc)3 'DESpec/BigBOSS'- 
like survey alone. The purple contour is the Icr confidence limit of 
the prior from WMAP projected onto this parameter space. The 
blue contours are the Icr constraints from 'DESpec/BigBOSS' 
importance sampled with the 1% gaussian prior on ra. The red 
contours are constraints from WMAP importance sampled with a 
'DESpec/BigBOSS' CAMB power spectrum projected onto this 
parameter space. 



our parameters. When we look at the contours for a larger 
survey (the kind of volume expected for fu- 
ture spectroscopic surveys such as DESpec and BigBoss) we 
see that the spaces covered by the two sets of contours are 
more similar. If we look at the Id probability distribution 
around cy again we can see that it is more Gaussian than 
m the l{h-^Gpcf case since the marginalised probabilities 
and mean likelihoods are more similar. This is a pattern 
which is repeated for all the parameters in our toy model. 
The larger survey enables us to look closer into the peak 
of the distribution, which is more Gaussian. As anticipated, 
prior knowledge of the location and uncertainty in the sound 
horizon at last scattering improves our ability to constrain 
the shape of the power spectrum and geometric distortions. 

By adding a 1% prior onto the sound horizon of 
(Jr^ — 1.4Mpc/i~^ in a Fisher matrix analysis we can see 
how calibrating with the CMB improves constraints in our 
Cx — C|| parameter space for a small l(/i~^Gpc)'^ survey by 
a factor of several and larger 20(/i~^Gpc)^ survey by more 
than a factor of two. 

We then conducted a likelihood analysis using nested 
sampling on our numerically produced power spectrum. 
A Gaussian prior on the sound horizon improves these 
constraints by a factor of two for the 20(/i~^Gpc)'', 
'DESpec/BigBOSS'-like survey. The combined constraints 
from the 20(/i~^Gpc)^ survey -I- WMAP are nearly a fac- 
tor of 10 times better than the constraints from the survey 
alone. The WMAP prior contains considerably more infor- 
mation and is more orthogonal than the Gaussian prior on 
rs- 
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Figure 6. Normalised probability distributions of cy using our 
toy model for a l(/i~^Gpc)'' survey with shot noise, n = 5 X 
10~'*(/i~^Mpc)~'^. The black solid line is the Gaussian probabil- 
ity distribution based on the Fisher matrix analysis. The solid 
green line is the Nested Sampling likelihood marginalised over all 
other parameters but with no prior on the sound horizon. The dot 
dashed blue line is the normalised Id marginalised Gaussian prob- 
ability distribution including a prior on the sound horizon. The 
dot dashed red line is the Nested Sampling likelihood marginalised 
over all other parameters and importance sampled with a Gaus- 
sian prior on the sound horizon. 



Figure 7. Normalised Id probability distributions of the line 
of sight geometric distortion parameter C|| using our toy model 
for a 20(h-^Gpc)3 'DESpec/BigBOSS' survey with shot noise, 
n = 5 X 10~*(/i~^Mpc)~^. The black solid line is the Gaussian 
probability distribution based on the Fisher matrix analysis. The 
solid green line is the Nested Sampling likelihood marginalised 
over all other parameters but with no prior on the sound horizon. 
The dot dashed blue is the Id marginalised Gaussian probabil- 
ity distribution including a prior on the sound horizon. The dot 
dashed red line is the Nested Sampling likelihood marginalised 
over all other parameters and importance sampled with a Gaus- 
sian prior on the sound horizon. The nested sampling probability 
distribution is more Gaussian than the l(/i~^Gpc)^ case. 



7 DISCUSSION 

In this paper we have addressed two issues of great interest 
to current research in Cosmology, in particular in connec- 
tion with the forthcoming generation of large galaxy red- 
shift surveys. The first issue is physical - the callibration of 
the BAO using the sound horizon in the CMB. The second 
issue is statistical - a comparison of parameter fitting using 
Fisher matrix and Monte Carlo techniques. 

We examined if the degeneracy between geometrical 
(Alcock-Paczynski) and dynamical (redshift space) distor- 
tions in the pattern of the galaxy distribution could be 
lifted. The key point here is to use the BAO in the pat- 
tern of the galaxy distribution as a feature of known phys- 
ical size, the sound horizon ~ 150Mpc. We have shown, 
using a toy model, that this indeed can be callibrated us- 
ing the equivalent feature in the CMB. By adding a prior 
onto the sound horizon of ~ 1% we have shown, using a 
Fisher matrix analysis, error ellipses for line of sight and 
tangential distortion parameters shrink by by a factor of two 
for a 20(/i-^Gpc)^ 'DESpec/BigBOSS'-like survey with shot 
noise. This improvement is even more marked in smaller sur- 
veys, for l(/i~^Gpc)^ the improvement is nearly a factor of 
10. When our model for the power spectrum becomes more 
complicated (using CAMB) and we use the same Gaussian 
prior of ~ 1% on the sound horizon the improvement of the 
ellipse is by a factor of 2.5. When we start to include real 
data (from WMAP), which contains more information than 



just the sound horizon scale, then the improvement of the 
error ellipse is by a factor of 21. 

On the statistical side we have carried out a Monte 
Carlo Nested Sampling analysis on our parameter space. 
We find that Monte Carlo and Fisher methods can agree 
reasonably well for large volume (~ 20(ft~^Gpc)'') surveys 
with parameter constraints being nearly symetric and 68% 
likelihood contours agreeing to within a factor of two. How- 
ever, for smaller volume surveys the difference between con- 
straints from the two methods can be greater with the areas 
of 68% likelihood contours differing by more than a factor 
of four. The Nested Sampling analysis demonstrates that in 
general the likelihood distributions in the parameter space 
our toy model can be non-Gaussian. This undermines a cen- 
tral assumption in the Fisher matrix analysis, where errors 
are Gaussian by construction. All the likelihood distribu- 
tions are strongly skewed, with the exception of that around 
p. This likelihood falls off more steeply than a Gaussian, 
having kurtosis. In this example better knowledge of does 
not help to improve constraints on j3. 

The Fisher constraints on the amplitude of the power 
spectrum for a l{h~^Gpc)^ survey demonstrate the unphys- 
icality of the Fisher formalism. Allowed within the 68% con- 
fidence limits are negative and zero values for A* . Clearly in 
this case the Fisher information can only provide us with an 
upper limit. The Monte Carlo analysis does provide us with 
an upper and a lower limit on the allowed values of A* . 
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For most pairs of parameters the Monte Carlo and 
Fisher constraints are both markedly improved and more 
similar to each other with the addition of the prior on Vs. 
This is likely to be because the addition of a Gaussian 
prior makes the combined distribution more Gaussian, in 
the Monte Carlo case. There is no reason to assume, how- 
ever, that CMB derived constraints on the sound horizon 
will be Gaussian. 

The discrepancy between the Fisher likelihood space 
and perhaps more reliable Monte Carlo likelihood space is 
not a trivial point. When constraining the parameters of 
the toy model we allowed the parameters to vary indepen- 
dently but since the distortions are dependant on cosmology 
they cannot truly be varied independently of the shape of 
the underlying dark matter power spectrum, crucially in the 
context of this paper, this includes the shape and scale of 
the BAO. 

There is a large difference between the improvement of 
constraints when a Gaussian prior on the sound horiazon is 
used and when we start to include real data (from WMAP). 
This could be because our parameterisation is different but 
is probably just because nature is more complex than a toy 
model. The CMB and also the real world galaxy power spec- 
trum contain complexities and subtleties which are poorly 
understood. 

Importance sampling relies on the two experiments hav- 
ing similar likelihoods, i.e. the chain used covers most of the 
likelihood volume in both experiments. If it is the case that 
the experiments provide highly orthogonal information then 
importance sampling is not appropriate because only the 
peak of the secondary distribution will be explored. This 
will give rise to erroneously tight constraints. This may be 
cause to doubt the constraints we have derived by impor- 
tance sampling the WMAP chain. It may be the case that if 
the WMAP chain sampled a wider area then the combined 
constraints would be broader. 

Even relatively fast Monte Carlo algorithms such as 
Nested Sampling are slower and more computationally de- 
manding than Fisher matrix methods. Therefore the Fisher 
information remains a useful starting point when exploring 
the likelihoods of cosmological parameters. Further under- 
standing of the affect of CMB callibration could be attained 
by expanding the cosmological parameter set used to de- 
scribe geometrically and dynamically distorted large scale 
structure measurements. 

While joint analysis of cosmological data sets can be 
carried out as a 'black box' we have attempted here to gain 
insight and intuition into the interplay between large scale 
structure and CMB data, and to quantify the limitations of 
the Fisher matrix approach relative to Monte Carlo tech- 
niques. This approach can be extended further by more de- 
tailed modelling of e.g. non-linear redshift distortions and 
biasing, and on the statistical testing of different MCMC 
codes. Given the huge human and financial efforts invovled 
in the next generation of galaxy surveys it is crucial to un- 
derstand the physical effects and on the other hand to make 
sure that the statistics are robust. The method presented 
here can be directly applied to the next generation of spec- 
troscopic surveys such as DESpec and BigBOSS combined 
with Planck. 
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APPENDIX 

Below are thge logarithmic derivatives used to build the 
Fisher matrix of the toy model of the observed power spec- 
trum presented in section [21 
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where k — \/(fc||/c||)^ -I- (fcx/cx)^ and P' is the first 
derivative of the power spectrum with respect to k. 



